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Abstract 

Background: Alloplasmic lines provide a unique tool to study nuclear-cytoplasmic interactions. Three alloplasmic 
lines, with nuclear genomes from Triticum aestivum and harboring cytoplasm from Aegilops uniaristata, Aegilops 
tauschii and Hordeum chilense, were investigated by transcript and metabolite profiling to identify the effects of 
cytoplasmic substitution on nuclear-cytoplasmic signaling mechanisms. 

Results: In combining the wheat nuclear genome with a cytoplasm of H. chilense, 540 genes were significantly 
altered, whereas 1 1 and 28 genes were significantly changed in the alloplasmic lines carrying the cytoplasm of 
Ae. uniaristata or Ae, tauschii, respectively. We identified the RNA maturation-related process as one of the most 
sensitive to a perturbation of the nuclear-cytoplasmic interaction. Several key components of the ROS chloroplast 
retrograde signaling, together with the up-regulation of the ROS scavenging system, showed that changes in the 
chloroplast genome have a direct impact on nuclear-cytoplasmic cross-talk. Remarkably, the H. chilense alloplasmic 
line down-regulated some genes involved in the determination of cytoplasmic male sterility without expressing the 
male sterility phenotype. Metabolic profiling showed a comparable response of the central metabolism of the 
alloplasmic and euplasmic lines to light, while exposing larger metabolite alterations in the H. chilense alloplasmic 
line as compared with the Aegilops lines, in agreement with the transcriptomic data. Several stress-related 
metabolites, remarkably raffinose, were altered in content in the H. chilense alloplasmic line when exposed to high 
light, while amino acids, as well as organic acids were significantly decreased. Alterations in the levels of transcript, 
related to raffinose, and the photorespiration-related metabolisms were associated with changes in the level of 
related metabolites. 

Conclusion: The replacement of a wheat cytoplasm with the cytoplasm of a related species affects the nuclear- 
cytoplasmic cross-talk leading to transcript and metabolite alterations. The extent of these modifications was limited 
in the alloplasmic lines with Aegilops cytoplasm, and more evident in the alloplasmic line with H. chilense cytoplasm. 
We consider that, this finding might be linked to the phylogenetic distance of the genomes. 
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Background 

The genetic information of the eukaryotic organisms is 
divided into a nuclear genome and cytoplasmic genomes 
(mitochondrion and chloroplast, sometimes referred to 
as plasmon). Nevertheless, mitochondrion and chloro- 
plast contain only a few hundred genes, and their func- 
tionality is largely dependent on nuclear genes. This 
condition imposes a complex coordination between nu- 
clear and cytoplasmic gene expression activities to as- 
sure that all mitochondrion and chloroplast proteins are 
timely and correctly formed [1,2]. Alloplasmic lines, 
alien cytoplasm substitution [3], offer a unique oppor- 
tunity to study the nuclear-cytoplasmic interactions and 
the traits whose expression is dependent on the coordin- 
ate activity of nuclear and cytoplasmic genes [4-6]. The 
most relevant cytoplasm-inherited trait is cytoplasmic 
male sterility (CMS) [1], although other traits are also 
known (e.g., insect resistance [7]). 

Alloplasmic lines created using a nuclear genome from 
Triticum spp. and a cytoplasm from Aegilops were the first 
ones created in grasses [8]. Nevertheless, the instability of 
some of the traits and a detrimental phenotype reported 
in the Aegilops-Triticum system has fostered the investiga- 
tion of new cytoplasmic-nuclear systems. Due to its high 
crossability with other members of the Triticeae tribe [9], 
Hordeum chilense Roem et Schultz has been of interest as 
a source of new traits to be transferred to wheat, and it 
has also been suggested as a basis for new cytoplasmic- 
nuclear systems [10-12]. Aside from the possible beneficial 
effect that can be derived from the introgression of cyto- 
plasm genomes from wild species into crops, a detrimental 
effect has also been consistently reported up to this point. 
It was observed that the alloplasmic lines carrying the 
Aegilops spp. or H. chilense cytoplasm performed worse 
than the euplasmic line in regard to many agronomic 
traits [6,13]. 

A non-natural combination of nuclear and cytoplasm 
genomes results in an essential alteration of the gene 
balance that infringes on the adaptability of the new 
genotype and brings about changes in quantitative traits 
and biological functions under the influence of mito- 
chondrial and chloroplast genomes [14], which, in turn, 
can have an adverse effect on the responses to biotic and 
abiotic stresses [15]. The chloroplast and mitochondrial 
genomes work in coordination with the nuclear genome 
to support their functions and their adaptation to the 
changeable environmental conditions [16]. Conversely, 
alteration of the cytoplasm genome has a relevant im- 
pact on nuclear gene expression. For instance, many nu- 
clear genes were found altered in a Brassica napus CMS 
line, particularly those encoding proteins involved in the 
organelles protein import machinery and the genes 
expressed in stamens and pollens, as well as the genes 
implicated in cell- wall remodeling or architecture [17]. 



Although substantial work has been carried out to inves- 
tigate gene expression in alloplasmic lines [18-20], a ma- 
jority of these studies were devoted to genes associated 
with floral organ development in CMS Brassicaceae 
[21]. As such, the absence of published comparative 
studies investigating the alteration in transcript and me- 
tabolite profiles, due to a foreign cytoplasm, limits the 
understanding of nuclear-cytoplasmic interaction and its 
output on the cellular processes. The central role of the 
chloroplast and mitochondria on plant cell metabolism 
and the known potential of CMS in crop breeding call 
for a comprehensive understanding of the global changes 
occurring as a result of foreign cytoplasm integration. 

Very often the analysis of nuclear-cytoplasmic inter- 
action is carried out by means of nuclear mutants [22], 
on the contrary, differences detectable between an allo- 
plasmic line vs. the corresponding euplasmic can reveal 
the impact of small variations of the cytoplasm genome 
on the nuclear-cytoplasmic cross-talk. This study was 
carried out to assess the impact of alloplasmic lines on 
cell functionality. Parallel transcript and metabolite pro- 
files were performed on three wheat alloplasmic lines 
obtained by cytoplasmic genome introgression from wild 
species (Ae. uniaristata, Ae. tauschii and H. chilense) 
into cultivated wheat and in their respective euplasmic 
genotypes. The effect that phylogenetic relatedness has 
on gene expression and metabolic regulation of common 
wheat is discussed. We hypothesize that, in order to 
maintain its functionality, the cell must integrate the 
perturbation caused by a non-natural nuclear-cytoplasmic 
interaction and reorganize its regulatory system and me- 
tabolism accordingly. 

Results 

The substitution of a wheat cytoplasm with an alien 
cytoplasms has an impact on the nuclear transcriptome 

Three wheat alloplasmic lines and the corresponding 
euplasmic controls were subjected to transcriptome ana- 
lysis to assess the effect of cytoplasm substitution on gene 
expression. The alloplasmic lines T183 and T195 were de- 
veloped through the introgression of the cytoplasm from 
Ae. uniaristata (T183) and Ae. tauschii (T195) in the nu- 
clear background of T. aestivum cv. Chris by Prof. S.S. 
Maan [23]. The alloplasmic line TH237 was produced by 
introgressing the H. chilense accession H7 cytoplasm into 
the nuclear background of T. aestivum accession T20 [13]. 
The seeds of the alloplasmic lines were multiplied at IAS- 
CSIC and verified for cytoplasm identity with specific 
cytoplasm markers. T183, T195 and TH237 were selected, 
based on published data [13,24], in order to represent the 
largest cytoplasmic diversity compatible with an overall 
normal phenotype (no evidence of development or flower 
abnormalities). A global gene expression analysis was per- 
formed on RNA samples isolated from the fully expanded 
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leaves of two-week old plants grown at 600 |iE m' 2 s' 1 . 
Array hybridizations were carried out using the Affymetrix 
Wheat Genome Array. A Principal Component Analysis 
(PCA, a mathematical representation of the dataset diver- 
sity [25]), performed on the transcriptome dataset, identi- 
fied two main components that explain 29.5% and 9.6% of 
the variance (Figure 1). The first component separated the 
alloplasmic lines developed in the genetic background of 
the accession T20 from those developed in the genetic 
background of the cultivar Chris. The second component 
allowed for the separation of the alloplasmic line TH237 
(H. chilense cytoplasm) from the corresponding euplasmic, 
but not the separation of the other two alloplasmic lines 
(Aegilops spp. cytoplasm) from the cv. Chris. The PCA re- 
sults suggest the existence of important differences be- 
tween the two genetic backgrounds T20 and Chris; 
therefore, all the findings of this work are based only on 
the comparisons between each alloplasmic line and its 
corresponding euplasmic. 

A Welch t-test, using a two-fold change cut-off and a 
false discovery rate correction for multiple testing, iden- 
tified 562 probe sets, each of them representing a puta- 
tive gene, differentially modulated in at least one 
experimental comparison (Additional file 1: Table SI). 
When the wheat nuclear genome was combined with 



the cytoplasm of H. chilense, 540 genes significantly 
altered their expression (295 up- and 245 down- 
regulated), whereas 11 (2 up- and 9 down-regulated) and 
28 (5 up- and 23 down-regulated) genes were significandy 
modulated in the alloplasmic lines carrying the cytoplasm 
of Ae. uniaristata and Ae.tauschii, respectively. Two 
up-regulated (Ta.24245.1.Al_at and TaAffx.57297.1.Sl_at 
annotated as maturase with chloroplast and mitochondrial 
localization) and five down-regulated genes (TaAffx.6196. 
2.Sl_s_at and TaAffx.6196.1.Sl_atannotated as cyt b 55 9, 
TaAffx.l28795.10.Sl_x_at annotated as ribosomal protein 
rpsl2, TaAffx.l28617.1.Sl_x_at annotated as NADPH- 
quinone oxidoreductase and TaAffx.l28795.12.Sl_x_atan- 
notated as ABC transporter permease) were common in 
all comparisons (Figure 2), underlining some common re- 
sponses across all alloplasmic lines. Significandy, the five 
down-regulated sequences were all encoded by the chloro- 
plast genome. 

The up- and down-regulated genes were divided into 
classes according to their expression behavior and classi- 
fied using wheat gene identifier categories of the MapMan 
software [26]. Figure 3 summarizes the metabolism over- 
view of up- and down- regulated genes in the H. chilense 
alloplasmic line vs. the corresponding euplasmic control 
according to the MapMan classification. The results 
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Figure 1 PCA plots of wheat alloplasmic array hybridization data. The samples have been represented according to the variance explained 
by the two principal component (29.5% and 9.6% on x and y axis, respectively) variations detected through transcriptome comparison. Biologica 
replicates of each sample are surrounded by lines of different colors.TH237: alloplasmic line with H. chilense cytoplasm; T20: wheat accession 
representing the euplasmic control for TH237; T1 83: alloplasmic line with Ae. uniaristata cytoplasm; T1 95: alloplasmic line with Ae. tauschii 
cytoplasm; Chris: wheat cultivar representing the euplasmic control forT183 and T195. 
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Figure 2 Representation of up- and down-regulated genes. The Venn diagrams show the genes up- and down-regulated in the alloplasmic 
lines developed with the cytoplasm of Ae. uniaristata (T183), Ae. tauschii (T195) and H. chilense (TH237) in comparison with the corresponding 
euplasmic controls. A single gene, up-regulated in some comparisons and down-regulated in others, has been excluded in this analysis. 



showed that the genes associated with mitochondrial elec- 
tron transport, photosynthesis, redox and hormones were 
generally up-regulated, while the genes associated with 
lipid metabolism (above all, lipid metabolism in pollen de- 
velopment), secondary metabolism, and signaling were 
mainly down-regulated. 



The analysis of the sub-cellular localization of the pro- 
teins, encoded by the genes that modulated in response to 
cytoplasm substitution, identified 65 genes coding for 
plastid-targeted proteins and 22 genes coding for 
mitochondria-targeted proteins (15.2% of the 562 genes 
modulated in the experiment, Additional file 1: Table SI). 
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Figure 3 MapMan classification of differentially expressed genes. MapMan Metabolism overview maps showing the differences in transcript 
levels between the alloplasmic line TH237 {H. chilense cytoplasm) and the T20 euplasmic control [26]. The log 2 expression ratios of genes altered 
in the comparison were loaded into the MapMan Image Annotator Module to map the wheat transcriptome data to define functional categories 
and to display them onto pathway diagrams. In the color scale, red and blue represent decreased and increased gene expression, respectively, in 
the H. chilense alloplasmic line as compared with the corresponding T20 euplasmic line. 
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Among them, 18 plastid and 11 mitochondrial-localized 
proteins were encoded by cytoplasmic genomes. 

A validation experiment was carried out with six dif- 
ferentially expressed genes (AtnMat3, Executer2, Flu, 
psbM, FAR, LSD1, Additional file 2: Table S2) selected 
for their role in the chloroplast-retrograde signaling 
pathway (see below for their descriptions). The tested 
genes showed a value of 0.78 of the Pearson correlation 
coefficient between array and qRT-PCR expression 
values with an expression trend conserved across the 
samples between array and qRT-PCR results (detailed 
results are reported in Additional file 2: Table S2). 

A significant proportion of the genes that were altered 
in response to the substitution of a wheat cytoplasm 
with an alien cytoplasm can be traced back to specific 
cellular processes or metabolic pathways. The most rele- 
vant ones are described in the next sections. 

RNA maturation-related processes in the organelles are 
sensitive to perturbations of the nuclear-cytoplasmic 
cross-talk 

The nucleus controls organelle gene expression through 
a number of nuclear-encoded factors that act post- 
transcriptionally on specific organelle target transcripts 
(anterograde signaling pathway [27]) and that regulate 
the maturation, stability and/or translation of organelle 
transcripts. RNA maturation in mitochondrion and chloro- 
plast, including RNA splicing, is highly dependent on 
nuclear-encoded splicing factors. A general consequence of 
the perturbation induced in the alloplasmic lines was a 
strong modulation of the nuclear genes coding for 
organelle-targeted maturases. The probe set TaAffx.57297. 
l.Sl_at homologous to AtnMat3, a maturase with demon- 
strated mitochondrion localization [28], was up-regulated 
with a log2 fold change of 5.3, 3.7, and 1.6 in alloplas- 
mic lines carrying Ae. uniaristata, Ae. tauschii and 

H. chilense cytoplasm, respectively. Similarly, Ta.24245. 

I. Al at homologous to AtnMat4, localized in both 
mitochondrion and chloroplast [28], was also up- 
regulated with a log2 fold change of 6.5, 5. land 3.0 in 
the same comparisons indicated above. 

Although the targets of AtnMat3 and AtnMat4 are still 
unknown, it should be noted that the RNA coding for five 
chloroplast encoded genes, cyt b 55 9 (two probe sets), ribo- 
somal protein rpsl2, NADPH-quinone oxidoreductase and 
ABC transporter permease, were down-regulated in all 
alloplasmic lines, an expression trend that suggests a pos- 
sibly functional link between the up-regulation of nuclear- 
encoded maturase genes and chloroplast expression. 

Furthermore, the wheat chip array carried a probe set 
(TaAffx.l09850.1.Al_s_at) annotated as chloroplast matur- 
ase K (MatK) in Plexdb (http://www.plexdb.org/), an essen- 
tial component for the splicing of plastid RNAs. This probe 
set was down-regulated in the alloplasmic line carrying the 



H. chilense cytoplasm. The plastidial MatK is associated 
with several chloroplast genes with class II introns [29]; one 
of them, the mRNA precursor coding for ribosomal protein 
rpl2 (TaAffx. 128896.8. Al_at), was also down-regulated with 
a log2 fold change of -3.3 in the same alloplasmic line. 

The substitution of a wheat cytoplasm with an alien 
cytoplasms alters the chloroplast retrograde signaling 

To coordinate nuclear gene expression with the functional 
or metabolic state of plastids, the plant cells have acquired 
retrograde signaling pathways from plastid to nucleus, in- 
cluding redox signaling, Mg-protoporphyrin IX (an inter- 
mediate of the tetrapyrrole biosynthetic pathway), signals 
that are generated by inhibiting plastid gene expression or 
by accumulating various reactive oxygen species (ROS), 
such as hydrogen peroxide (H2O2), singlet oxygen ^C^), 
hydroxyl radical (OH) and superoxide anion (O2) [30]. 
Excess light can potentially regulate nuclear gene expres- 
sion by affecting tetrapyrrole biosynthesis. Plants exposed 
to high light, as was the condition for the transcriptomic 
experiment here reported, require specific adaptations that 
are mediated by chloroplast retrograde signaling [30]. 
Therefore, we reasoned that alterations in this signaling 
pathway should be evident in the experimental dataset, 
and a specific mining work was dedicated to finding 
genes that represent the key components of this signal- 
ing with an altered expression in response to a cyto- 
plasm substitution. 

HEMA1 encodes one of the three isoforms of the 
glutamyl-tRNA reductase, which catalyzes the first key 
regulatory step of the tetrapyrrole biosynthesis, and 
three probe sets (Ta.3243.1.Sl_at, Ta.3243.1.Sl_x_at and 
TaAffx.8262.1.Sl_x_at) corresponding to HEMA1 were 
up-regulated in the alloplasmic line carrying the cyto- 
plasm of H. chilense. It is known that the accumulation 
of chlorophyll precursors, such as Mg-Protoporphyrin 
IX, may stimulate the expression of the nuclear gene 
HSP70 [31]. Importantly, in the H. chilense alloplasmic 
line, the up-regulation of HEMA1 was associated with a 
parallel up-regulation of HSP70-1 (Ta.24154.1.Sl_s_at). 

Low concentrations of ROS act as signaling molecules 
for a number of regulated processes during plant growth 
and development [32], as well as in response to a variety 
of environmental stimuli [33,34]. The existence of a 
specific 1 0 2 signaling pathway has been shown in 
Arabidopsis through a mutant analysis of flu (fluorescent 
in blue light) [35] and its regulators named executer 
(exiand ex2) [36,37]. Three probe sets (Ta.24301.1. 
Al_x_at, TaAffx.l28618.1.Sl_at andTa.24301.1.Al_s_at), 
corresponding to Flu, and one probe set (TaAffx.110498.1. 
Sl_at), corresponding toEx2, were all up-regulated in the 
alloplasmic line carrying the H. chilense cytoplasm, in 
comparison with the corresponding euplasmic. Further- 
more, the same alloplasmic line was characterized by an 
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up-regulation of three probe sets (Ta. 28069. l.Al_at, 
Ta.6594.1.Sl_at and Ta.6594.1.Sl_s_at), corresponding to 
LSD1 (LESION SIMULATING DISEASE 1), a putative 
regulator of the hypersensitive response and light acclima- 
tion driven by ROS accumulation [38]. Notably, the 
up-regulation of LSD1 was associated with an altered 
expression of 26 probe sets involved in the hypersen- 
sitive response to biotic stress belonging to the path- 
way of methyl jasmonate and salicylic acid (Table 1). 
Among these sequences, there were enzymes involved in 
the synthesis of salicylic acid (isochorismate synthases, 
one probe set), the WRKY70 transcription factor (two 
probe sets), lipoxigenase 2 (three probe sets), DIR1 
(Defective Induced Resistance 1, a gene involved in the 
systemic acquired resistance -SAR, two probe sets), FMOl 
(Flavin-dependent Monoxigenase 1, an essential compo- 
nent of SAR, three probe sets) and several other pathogen 
responsive proteins. Furthermore, in the H. chilense 
alloplasmic line, we found several down-regulated 
genes involved in programmed cell death: ACD1 
(Accelerated Cell Dead 1, TaAffx.67544.1.Sl_at), STP13 
(Sugar Transport Protein 13, Ta.5766.1.Sl_at), VEP1 
(Vein Patterning 1, Ta. 13682. l.Al_at) and two probe 
sets annotated as putative metacaspases (Ta.3154.1. 
Sl_at and Ta.l0581.1.Al_at) [39-42]. 

Since Flu, Ex and LSD1 are key components of the ROS 
chloroplast retrograde signaling [37,43,44], the identifica- 
tion of these genes, among those altered in the H. chilense 
alloplasmic line, as well as of other downstream genes, 
suggests that a change in the chloroplast genome has a 
direct impact on the chloroplast retrograde signaling 
mediated by 1 C>2. 

The substitution of a wheat cytoplasm with an alien cytoplasm 
alters the expression of photosynthetic components 

Retrograde signaling is known to control the expression of 
many genes coding for components of the PSII, and in- 
deed, a number of these genes, mostly encoded by the 
chloroplast genome, were changed in the alloplasmic lines 
(Table 2 and Additional file 1: Table SI). Two genes (probe 
sets TaAffx.6196.1.Sl_at and TaAffx.6196.2.Sl_s_at) cod- 
ing for the cytochrome b 559 alpha chain (psbE) were 
down-regulated in all alloplasmic lines, while a gene cod- 
ing for a component of the psbM protein (TaAffx.48564.2. 
Sl_s_at) was strongly down-regulated in the H. chilense 
alloplasmic line (log2 fold change of -7.9). Furthermore, 
eleven genes coding for chlorophyll a/b-binding protein 
(LHCII), five genes related to LT29/APX4 (a protein asso- 
ciated with photosystem II [45] and a gene coding for a 
Dl precursor (TaAffx.l28757.1.Sl_at) were up-regulated 
in the alloplasmic line carrying the H. chilense cytoplasm. 
Lastly, the alloplasmic line carrying the Ae. tauschii cyto- 
plasm showed a down-regulation of a gene encoding a 
psbX-related protein (Ta.20878_at). 



NADH-plastoquinone oxidoreductase chain 5 (TaAffx. 
128617.1.Sl_x_at), a component of the chloroplast 
NADH-ubiquinone oxidoreductase, was down-regulated 
in all alloplasmic lines. Furthermore, a probe set, corre- 
sponding to ATP synthase (TaAffx.l28565.9.Sl), was 
up-regulated in the alloplasmic line with the H. chilense 
cytoplasm. 

To conclude, the findings described above highlight 
that the substitution of the wheat cytoplasm with the 
cytoplasm of H. chilense, impacts the expression of sev- 
eral nuclear genes encoding photosynthetic components, 
suggesting a modification of the nuclear-chloroplast 
cross-talk, at least under high light conditions. 

The substitution of a wheat cytoplasm with an alien 
cytoplasm modifies the expression of genes coding for 
components of the mitochondrial electron transport chains 

The mitochondrial electron transport chain is comprised 
of four large protein complexes (I, II, III and IV) that 
interact with each other via the small lipid ubiquinone 
and the small protein cytochrome c [46]. Three probe sets 
(Ta.24230.1.Sl_at, Ta.28812.1.Sl_at and TaAffx.87200.1. 
Sl_at) related to NADH dehydrogenase (EC 1.6.5.3, com- 
plex I), the first step of the oxidative phosphorylation, had 
an altered expression profile in the alloplasmic lines. The 
first two genes were up-regulated in the alloplasmic line 
with the H. chilense cytoplasm, while the last one was up- 
regulated in the alloplasmic lines with the Ae. uniaristata 
and the H. chilense cytoplasms. Furthermore, four probe 
sets (TaAffx.69955.1.Sl_at, Ta.28701.1.Al_at, TaAffx.ll 
4431.1.Sl_s_at and Ta.28645.1.Sl_at), related to different 
subunits of the cytochrome c oxidase (COX; EC 1.9.3.1, 
complex IV) [45], were up-regulated in the alloplasmic 
line with the H. chilense cytoplasm. TaAffx.4544.2.Sl_s_at, 
involved in cytochrome c oxidase biogenesis [47], was, 
instead, down-regulated in the Ae. tauschii and H. chilense 
alloplasmic lines. Notably, most of the genes described in 
this paragraph are encoded by the mitochondrial genome 
(Additional file 1: Table SI). 

The expression of genes involved in cytoplasm male 
sterility is dependent on the interaction between nuclear 
and cytoplasm genomes 

The transcriptome comparison of the alloplasmic line, 
carrying the cytoplasm of H. chilense with its euplas- 
mic control, highlighted the alteration of several genes 
known to be involved in the determination of cyto- 
plasm male sterility (CMS). Three up-regulated probe 
sets (Ta.l967.1.Sl_x_at, TaAffx.l04812.1.Sl_s_at and 
Ta.l967.2.Al_x_at) were related to LOX2 (chloroplast 
13-lipoxygenase), while two down-regulated probe sets 
(Ta.8097.1.Al_at and Ta.l3650.1.Al_at) were anno- 
tated as LOX3 (chloroplast lipoxygenase). LOX2 and 
LOX3 play a role in the biosynthesis of jasmonate, and 
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Table 1 Probe sets encoding proteins involved in biotic stress responses significantly up- or down-regulated in the 
alloplasmic line TH237 carrying the H. chilense cytoplasm compared to the corresponding euplasmic line T20 



Probe set ID 

Ta.24195.3.S1_at 

Ta.24195.1.A1_at 

TaAffx.55225.1.S1_at 

TaAffx.94000.1.S1_at 

Ta.21533.1.A1_at 

Ta.21314.1.S1_at 

Ta.21314.1.S1_x_at 

Ta.22687.1.A1_at 

TaAffx.28302.2.S1_at 

TaAffx.78606.1.S1_at 

TaAffx.1 07480.1. S1_at 

Ta.1 9859.1. SI at 



TaAffx.1 04885.1. S1_at 

TaAffx.1 04885.2.S1_at 

TaAffx.1 31 61 1.1. S1_at 

TaAffx.1 31 61 1.4.S1_at 

Ta.1 967.1. S1_x_at 

TaAffx.1 0481 2.1 .SI _s_at 

Ta.1967.2.A1_x_at 

Ta.8097.1.A1_at 
Ta.13650.1.A1_at 
Ta.21353.2.S1_at 
Ta.8614.1.S1_at 

Ta.8614.2.S1_x_at 

TaAffx.37536.1.S1_at 
Ta.22221.1.Sl a at 



Log2 fold change 

+ 1.19 
+ 1.99 
+ 1.50 
-1.40 
+2.18 
-1.62 
-1.74 
-1.75 
-1.00 
-1.81 
-1.61 

+ 1.36 



+ 1.61 

+ 1.69 

+4.29 

+2.76 

+ 1.59 

+ 1.63 

+ 1.64 

+ 1.02 
+ 1.93 
-1.01 
-1.30 

-1.66 

+ 1.03 
-1.51 



Probe set description 

Vrgal -disease resistance gene homologues 
Vrgal -disease resistance gene homologues 

Leucine-rich repeat family protein similar to disease resistance family protein 

Barley stem rust resistance protein 

Powdery mildew resistance protein 

Disease resistance response protein (PR protein) 

Disease resistance response protein (PR protein) 

Disease resistance response protein (PR protein) 

Disease resistance-responsive family protein (PR protein) 

Disease resistance protein (NBS-LRR class -PR protein) 

Cold induced ice recrystallisation inhibition protein. Phytohormones involved in 
pathogen defense pathways (jasmonic acid and ethylene) 

FM01 (FLAVIN-DEPENDENT MONOOXYGENASE 1); mono-oxygenase FM01 is required 
for full expression of TIR-NB-LRR conditioned resistance to avirulent pathogens and 
for basal resistance to invasive virulent pathogens. Functions in an EDS1 -regulated but 
SA-independent 

FM01 (FLAVIN-DEPENDENT MONOOXYGENASE 1); mono-oxygenase FM01 is required 
for full expression of TIR-NB-LRR conditioned resistance to avirulent pathogens and for 
basal resistance to invasive virulent pathogens. Functions in an EDS1 -regulated but 
SA-independent 

FM01 (FLAVIN-DEPENDENT MONOOXYGENASE 1); mono-oxygenase FM01 is required 
for full expression of TIR-NB-LRR conditioned resistance to avirulent pathogens and for 
basal resistance to invasive virulent pathogens. Functions in an EDS1 -regulated but 
SA-independent 

DIR1 (DEFECTIVE IN INDUCED RESISTANCE 1); lipid binding encodes a putative apoplastic 
lipid transfer protein that is involved in systemic acquired resistance (SAR). Mutants in this 
gene exhibit wild-type local resistance to avirulent and virulent Pseudomonas 

DIR1 (DEFECTIVE IN INDUCED RESISTANCE 1); lipid binding encodes a putative apoplastic 
lipid transfer protein that is involved in systemic acquired resistance (SAR). Mutants in this 
gene exhibit wild-type local resistance to avirulent and virulent Pseudomonas 

LOX2 (LIPOXYGENASE 2) Chloroplast lipoxygenase required for wound-induced jasmonic 
acid accumulation in Arabidopsis. Mutants are resistant to Staphylococcus aureus and 
accumulate salicylic acid upon infection. Identical to Lipoxygenase, chloroplast precursor 

LOX2 (LIPOXYGENASE 2) Chloroplast lipoxygenase required for wound-induced jasmonic 
acid accumulation in Arabidopsis. Mutants are resistant to Staphylococcus aureus and 
accumulate salicylic acid upon infection. Identical to Lipoxygenase, chloroplast precursor 

LOX2 (LIPOXYGENASE 2) Chloroplast lipoxygenase required for wound-induced jasmonic 
acid accumulation in Arabidopsis. Mutants are resistant to Staphylococcus aureus and 
accumulate salicylic acid upon infection. Identical to lipoxygenase, chloroplast precursor 

LOX3 (Lipoxygenase 3); iron ion binding/lipoxygenase 

Lipoxygenase, putative similar to LOX3 (Lipoxygenase 3), iron ion binding/lipoxygenase 
Putativesalicylic acid-binding protein PP 

WRKY70 (WRKY DNA-binding protein 70); transcription factor member of WRKY Transcription 
Factor; Group III. Function as activator of SA-dependent defense genes and a repressor of 
JA-regulated genes. WRKY70-controlled suppression of JA-signaling is partly executed by NPR1. 

WRKY70 (WRKY DNA-binding protein 70); transcription factor member of WRKY Transcription 
Factor; Group III. Function as activator of SA-dependent defense genes and a repressor of 
JA-regulated genes. WRKY70-controlled suppression of JA-signaling is partly executed by NPR1. 

Isochorismate synthases involved in salicylic acid synthesis 

ADC2 (ARGININE DECARBOXYLASE 2) encodes an arginine decarboxylase (ADC), a rate-limiting 
enzyme that catalyzes the first step of polyamine (PA) biosynthesis via ADC pathway in 
Arabidopsis thaliana. 
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Table 2 Probe sets encoding components of the photosynthetic apparatus significantly up- or down-regulated in the 
three alloplasmic lines compared with the corresponding euplasmic line 



Probe set ID 



Log2 fold change 
alloplasmic T183 vs. 
euplasmic Chris 



Log2 fold change 
alloplasmic T195 vs. 
euplasmic Chris 



Log2 fold change 
alloplasmic TH237 vs. 
euplasmic T20 



Probe set description 



TaAffx.6196.1.S1_at 
TaAffx.6196.2.S1_s_at 
TaAffx.48S64.2.S1_s_at 
Ta.20878.1.S1_at 

Ta.22984.1.S1_x_at 

Ta.28496.1.A1_at 

Ta.28496.1.A1_x_a 

Ta.29587.2.S1_x_at 

Ta.29587.3.A1_at 

Ta.30702.1.S1_x_at 

Ta.3249.1.S1_at 

Ta.3249.2.S1_x_at 

Ta.3249.3.A1„at 

Ta.3795.l.S1_x_at 

Ta Affx.1 1 41 27. 1 .SI _x_at 

Ta.488.1.S1_at 

Ta.488.1.S1_x_at 

Ta.488.2.Sl_at 

Ta.488.3.Sl_a_at 

Ta.488.3.S1_at 

TaAffx.1 28757.1. S1_at 

TaAffx.1 2861 7.1. S1_x_at 

TaAffx.1 28565.9.S1_at 
Ta.25398.1.S1_at 

TaAffx.1 2841 8. 14.A1_at 

Ta.26997.1.S1 at 



-5.81 
-5.92 



-5.65 
-5.63 



-1.07 



-5,14 



-4,65 



-5.26 
-4.96 
-7.93 



+2.61 
+2.54 
+1.33 
+3.23 
+1.99 
+341 
+1.72 
+347 
+1.13 
+1.83 
+1.50 
+1.51 
+1.67 
+1.54 
+148 
+1.48 
-4.62 

+4.98 
-1.10 

-1.26 

-1.12 



Cytochromeb559 alpha chain; psbE 

Cytochromeb559 alpha chain; psbE 

Photosystem II M protein; psbM 

Protein Photosystem II reaction centre 
X protein (PsbX), putative 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

Chlorophyll a/b-binding protein (LHCII 

APX4 (ASCORBATE PEROXIDASE 4) 

APX4 (ASCORBATE PEROXIDASE 4) 

APX4 (ASCORBATE PEROXIDASE 4) 

APX4 (ASCORBATE PEROXIDASE 4) 

APX4 (ASCORBATE PEROXIDASE 4) 

Photosystem II protein D1 precursor 

Chloroplast (NAD(P)H dehydrogenase, 
chain 5 

Protein ATPase I subunit 

ELIP1 (early light-inducible protein); 
chlorophyll binding 

ELIP1 (early light-inducible protein); 
chlorophyll binding 

Putative yearly light-inducible protein 



both are involved in the determination of male sterility 
[48]. LOXes catalyze the addition of oxygen to polyun- 
saturated fatty acids to produce the unsaturated fatty 
acid hydroperoxide. Linoleic and linolenic acids are 
the most common substrates for LOX [49], which fa- 
vors free fatty acids to sterified ones [50]. LOXes are 
also involved in oxylipin biosynthesis during stress, a 
process mainly associated with wounding and patho- 
gen attack [51]. 

Lipid metabolism is essential for normal pollen develop- 
ment [52,53]. A starting point in fatty-acid synthesis is the 
conversion of acetyl-CoA into malonyl-CoA. Six down- 
regulated probe sets (Ta.l2415.3.Sl_at, Ta.23988.1.Al_at, 
Ta.26144.1.Al_at, Ta.26144.1.Al_s_at; TaAffx.114108.1. 
Sl_at and TaAffx.38271.1.Al_at) are all related to fatty 



acyl-CoA reductase (FAR), a putative male sterility protein 
involved in wax biosynthesis [54]. Furthermore, a probe 
set (Ta.l6047.1.Sl_at), corresponding to male sterility 2 
(ms2, chloroplastic fatty acid reductase), and two probe 
sets (Ta.l456.1.Al_at and TaAffx.38271.1Al_at), related 
to the CER4, both involved in the synthesis of the lipids of 
the outer pollen wall [55], were down-regulated in the 
alloplasmic line with the H. chilense cytoplasm; these two 
genes belong to a family of genes encoding FAR enzymes. 
In the same sample, three additional probe sets (Ta.635.2. 
Al_at, Ta.635.2.Al_x_at and Ta.635.3.Sl_a_at), related to 
POP2 [pollen-pistil incompatibility) [56], an enzyme of the 
GABA-shunt involved in the conversion of the non- 
protein amino acid GABA to succinyl-semi-aldehyde [57], 
were also down-regulated, while a probe set (Ta.20591.2. 
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Sl_a_at), related to plantacyanin involved in anther devel- 
opment and pollination, was up-regulated. 

GC-MS analysis reveals the impact nuclear-cytoplasmic 
interaction has on leaf metabolism 

Considering that the nucleus-chloroplast cross-talk is es- 
sential for the adaptation of the leaf metabolism to envir- 
onmental variations, GC-MS-based metabolite profiling 
was done on the alloplasmic lines and corresponding 
euplasmic controls grown at two light intensities (low light 
150 uE m' 2 s' 1 , and high light 600 uE m' 2 s' 1 ) to test if a 
substitution of the cytoplasm impairs light response. 

To acquire an overview of the global changes in leaf me- 
tabolism across genotypes, and conditions, the GC-MS 
dataset was subjected to PCA [25]. The analysis revealed a 
clear separation of the samples due to light effect on PCI 
(Figure 4A and B) in both euplasmic backgrounds, ac- 
counting for more than 60% of the variance. These results 
suggest a comparable response to light intensity of the 
alloplasmic lines and their respective euplasmic controls, 
implying relatively functional mechanisms of electron dis- 
sipation and chloroplast functioning. 

Nevertheless, substituting the wheat cytoplasm with the 
H. chilense cytoplasm led to a perturbation of the central 
metabolism in response to high light (HL), resulting in a 
separation of the euplasmic control from the H. chilense 
alloplasmic line with the PCA not observed under low 
light (LL) (Figure 4A). On the genetic background of 
Chris, the alloplasmic line carrying the Ae. uniaristata 
cytoplasm showed a slight separation from the euplas- 
mic control under LL, while separating from the Ae. 



tauschii alloplasmic line. Under HL, the alloplasmic 
line carrying the Ae. tauschii cytoplasm grouped to- 
gether with the euplasmic control and separated from 
the other alloplasmic line (Figure 4B), implying the ex- 
istence of processes that affect the central metabolism 
in a species-specific manner. 

The alteration in the metabolite profiles in response to 
light intensity was validated by a two-way ANOVA, separ- 
ately testing the factors of cytoplasmic substitution, light 
treatment and their interaction (Additional file 3: Table S3 
and Additional file 4: Table S4). As expected, light was 
confirmed as having a major impact on the metabolite 
profiles of all genotypes (Additional file 3: Table S3 and 
Additional file 4: Table S4). Substitution in T20 with 
the H. chilense cytoplasm led to the alteration in the 
abundance of 33% (17) of the annotated metabolites, 
while the interaction arising between genotype and 
light accounted for the change in 31% (16) of metabo- 
lites (Additional file 3: Table S3). In the Chris series, 
cytoplasm substitution caused a significant change in 
the abundance of 22% (11) and 24% (12) metabolites in 
alloplasmic lines with Ae. uniaristata and Ae. tauschii 
cytoplasm, respectively. The interaction between geno- 
type and light treatment caused 16% (8) and 33% (17) 
of metabolites to be significantly altered in alloplasmic 
lines with Ae. uniaristata and Ae. tauschii cytoplasm, 
respectively (Additional file 4: Table S4). The average 
extent of the change in metabolite abundance was also 
dependent on the nuclear-cytoplasmic interaction. For 
example, the average fold change in the H. chilense 
cytoplasm-substituted plants was 2.6-fold, while in the 
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Figure 4 PCA plots of the wheat alloplasmic metabolic dataset. The metabolic dataset of leaves from plants grown at 150 (LL) and 600 (HL) 
uE rrf 2 s is shown. Each spot on the plot is a representative of a sample (that is, five biological samples for each category) A: PCA of alloplasmic 
TH237 (H. chilense cytoplasm) vs. T20 euplasmic control. The rectangles and circles represent low light and high light, respectively. The white and 
grey colors represent T20 and TH237, respectively. B: PCA of alloplasmic lines T1 83 (fie. uniaristata cytoplasm) and T1 95 (fie. tauschii cytoplasm) in 
comparison with the wheat cv. Chris used as the euplasmic control. Rectangles and circles represent low light and high light, respectively. White, 
grey and black denote Chris, T183 and T195, respectively. 
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Chris series, the average fold change in metabolite 
abundance was 0.5 and 1.9-fold in Ae. uniaristata and 
Ae. tauschii, respectively. 

Substitution of a wheat cytoplasm with a distant relative H. 
chilense cytoplasm alters leaf metabolic response to light 

Under LL, the comparison between the alloplasmic line 
carrying the cytoplasm from H. chilense and the corre- 
sponding euplasmic revealed significant variations in 
the abundance of amino acids (aspartate, glycine, glu- 
tamate and homoserine), sugars (arabinose, glucose and 
glucose 1, 6 anhydro-|3), sugar phosphates, mannitol, 
myo-inositol, organic acids (mainly citrate, malate, suc- 
cinate and itaconate) and putrescine (a stress-related 
polyamine) [58,59] (Figure 5, Additional file 5: Table S5). 
Markedly, in the alloplasmic line, homoserine, an 
intermediate in the synthesis of the aspartate-derived 
amino acids [60], was reduced to 10% of its content 
in the euplasmic control. A three-step pathway converts 
L-aspartate into homoserine; hence, the accumulation 
of aspartate and the low homoserine level might suggest 
a compromised step in the biosynthesis. The coordi- 
nated up-regulation of myo-inositol pathway-related 
genes was shown under environmental stresses [61], and 
often, the role of these metabolites is reported in the lit- 
erature to be associated with stress, as they serve as 
osmoprotectants [62]. 

Ae. uniaristata and Ae. tauschii cytoplasm substitu- 
tions showed milder effects on the metabolism of their 
euplasmic lines, compared to the effect of H. chilense. 
In several instances, the two Aegilops cytoplasms af- 
fected the metabolism in an opposite manner, while not 
showing significant alterations in stress-associated pro- 
cesses (Figures 6 and 7, Additional file 6: Table S6). 
Yet, a decreased level of intermediates associated with 
the TCA cycle (citrate, fumarate and malate) character- 
ized both lines. 

In the Aegilops series under HL, only a few metabolites 
were significantly altered, and as observed under LL, 
Ae. uniaristata and Ae. tauschii had opposite trends 
on some of the most significantly altered metabolites, 
e.g., homoserine and 5-caffeoylquinate (Figures 6 and 7, 
Additional file 6: Table S6). More significant metabolic 
alterations were found in the comparison between the 
H. chilense alloplasmic line and the corresponding euplas- 
mic control under HL (Figure 5). In the H. chilense allo- 
plasmic line, HL-induced alterations included a drop in 
the abundance of amino acid alanine, aspartic acid, 
homoserine and serine, compared to the euplasmic line. 
Significant increases in content were observed in sugars 
(except for arabinose and turanose), sugar phosphates 
and mannitol. The increase in glycine, glucose and 
most of the sugar phosphates in the H. chilense allo- 
plasmic line could also reflect an enhancement in the 



photorespiratory metabolism, a finding supported by 
the up-regulation in the same alloplasmic line of sev- 
eral key genes of the photorespiration, e.g., glycolate 
oxidase (GOX), serine hydroxyl- methyl transferase 
(SHMT) and glutamine synthetase (GS). The remark- 
able enhancement in raffinose abundance (33-fold) and 
its downstream metabolite galactose is suggestive of a 
stress response. Under HL conditions, organic acids 
(butanoate-2, 4-dihydroxy, succinate, glycerate, quinate 
and shikimate) were at lower levels in the H. chilense 
alloplasmic line. Succinate can be produced from 
within the TCA cycle or by the conversion of GABA to 
sucinyl-semialdehyde and succinate, a reaction governed 
by GABA-transaminase and succinic semialdehyde- 
dehydrogenase; the former gene was shown to be down- 
regulated at the transcript level (Ta.635.2.Al_at, Ta.635.2. 
Al_x_at and Ta.635.3.Sl_a_at). The analytical details are 
reported in Figure 5 and Additional file 4: Tables S4 and 
Additional file 5: Table S5. The decrease in shikimate in- 
termediates, quinate and shikimate, may have supported 
the production of secondary metabolites. 

The direct comparison between the transcriptomic 
and metabolomic data in the H. chilense alloplasmic line 
exposed to HL allows the identification of some corre- 
spondences between changes in the expression of genes 
coding for enzymes and the corresponding metabolites 
(Figure 8). The most clear was the up-regulation of 
phospho-glucomutase (EC 5.4.2.2), sucrose synthase (EC 
2.4.1.13) and |3-fructofuranosidase (3.2.1.26) mRNAs that 
were associated with an increased accumulation of 
sugars and sugar phosphates. 

Substitution of a wheat cytoplasm with an H. chilense 
cytoplasm had no effect on fatty acid accumulation 

Following the alteration of enzymes involved in lipid 
biosynthesis in the gene expression profiling of the allo- 
plasmic line carrying the H. chilense cytoplasm, a fatty 
acid analysis was carried out for this alloplasmic line and 
its euplasmic control. Eleven detectable fatty acids were 
annotated with linolenic acid (18:3co3) having the highest 
level in the alloplasmic line and its euplasmic control, 
compare to the other annotated fatty acids. Nevertheless, 
no significant differences were detected between the 
alloplasmic line and the euplasmic control grown at HL 
(Additional file 7: Table S7). The possible explanation 
for the differences in the fatty acid regulation at the 
metabolite and gene level could be that there are yet un- 
known regulatory genes that may participate in a post- 
translational process and hence do not affect the final 
metabolite level. 

Discussion 

The development of alloplasmic lines via alien cytoplasm 
substitution provides a unique tool with which to study 
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Figure 5 Metabolites altered in the TH237 (H. chilense cytoplasm) alloplasmic line. A-F denotes amino acids, sugars, sugar phosphates, 
sugar alcohols, organic acids, and N-compounds, respectively. Metabolites in each category that were significantly (p < 0.05) altered in either 
low light (LL, 150 uE rrf 2 s" 1 ) or high light (HL, 600 uE irf 2 s" 1 ) in the mature leaf of the alloplasmic line TH237 (H. chilense cytoplasm) in 
comparison with the wheat accession T20 used as the euplasmic control. Bars are fold changes of the alloplasmic line on the euplasmic control. 
The euplasmic control is given the threshold 1. Bars above or below the dashed line (threshold value 1) indicate an increase or a decrease of 
each metabolite in the alloplasmic genotype compared with the euplasmic control. Each bar is presented as means ± SEM of five biological 
repetitions. Glc, glucose; glcl, 6-anhybeta, glucose-1, 6-anhydro (5. 



the nuclear-cytoplasmic interactions in plants. Hexaploid 
wheat resulted from a combination of three nuclear ge- 
nomes (A genome from T. urartu, B genome from Ae. 
speltoides and D genome from Ae. tauschii) with a cyto- 
plasm genome from Ae. speltoides [63,64]. The crossabil- 
ity of hexaploid wheat with most Triticeae species has 
permitted the development of a panel of alloplasmic 
lines where the wheat nuclear genome has been coupled 



with cytoplasm genomes from species with different de- 
grees of phylogenetic relatedness. The current work in- 
vestigated the transcript and the metabolite profiling of 
three wheat alloplasmic lines carrying the cytoplasm of 
two very closely related species {Ae. uniaristata and Ae. 
tauschii) and one of a more distant species (H. chilense). 
The results showed that the H. chilense cytoplasm had a 
greater effect on the transcriptomic and metabolite 
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Figure 6 Metabolites altered in the T183 (Ae. uniaristata cytoplasm) alloplasmic line. A-D denotes amino acids, sugars, organic acids, and 
N-compounds, respectively Metabolites in each category that were significantly (p < 0.05) altered in either low light (LL, 150 uE irf 2 s" 1 ) or high 
light (HL, 600 uE rrf 2 s~') in the mature leaf of the alloplasmic line T183 with the Ae. uniaristata cytoplasm in comparison with the wheat cv. Chris 
used as the euplasmic control. Bars are fold changes of the alloplasmic line on the euplasmic control. The euplasmic control is given the 
threshold 1. Bars above or below the dashed line (threshold value 1) indicate an increase or a decrease of each metabolite of the alloplasmic genotype 
compared with the euplasmic control. Each bar is presented as means ± SEM of five biological repetitions. GPG, glycerophosphoglycerol. 



profiles of its euplasmic control than did the Aegilops 
cytoplasms on theirs. It has been suggested that the 
negative effects produced by substituting a native cyto- 
plasm with a cytoplasm from a more or less related spe- 
cies are dependent on the evolutionary distance between 
the two species [4,13]. These features were generally 
considered as the consequence of an incompatibility 
between nuclear and organelle-encoded components of 
mitochondria and/or chloroplast after their evolution 
into separate species. During evolutionary processes, 
nuclear genomes may have accumulated mutations, 
while organelle genomes accumulated compensatory 
mutations and vice versa. Therefore, the juxtaposition of 
cytoplasm and nucleus from distantly related genera 
exposes those non-compensated mutations that lead to 
detrimental effects on those traits determined by nuclear- 
cytoplasmic interactions [4]. Hence, the replacement of 



the wheat cytoplasm with the cytoplasm of the closely 
related species Aegilops had a limited impact on gene 
expression and central metabolism. In contrast, the 
cytoplasm of the phylogenetically more distant species 
H. chilense altered the expression of functionally relevant 
genes and led to impaired cellular functioning, which in 
turn, activated stress-related processes at the metabolite 
level. Nevertheless, the changes at the transcript level of 
the lipid biosynthetic genes were not reflected in the 
abundance of fatty acid. This discordance in omics data is 
not exclusive to this study. In fact, an extensive review by 
Fernie and Stitt [65] postulated that quite often, there is 
little linear relation between transcriptomic and metabolo- 
mic data, implying a higher level of the regulation and 
structure of metabolic networks. We suggest that phylo- 
genetic relatedness infers the degree of compatibility 
between nucleus and cytoplasm genomes, and in turn, the 
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Figure 7 Metabolites altered in the T19S [Ae. tauschii cytoplasm) alloplasmic line. A-E denotes amino acids, sugars, sugar phosphates, 
sugar alcohols, and organic acids, respectively. Metabolites in each category that were significantly (p < 0.05) altered in either low light (LL, 150 
uE rrf 2 s" 1 ) or high light (HL, 600 u£ rrf 2 s" 1 ) in the mature leaf of the alloplasmic line T195 (Ae. tauschii cytoplasm) in comparison with the wheat 
cv. Chris used as the euplasmic control. Bars are fold changes of the alloplasmic line on the euplasmic control. The euplasmic control is given the 
threshold 1. Bars above or below the dashed line (threshold value 1) indicate an increase or a decrease of each metabolite of the alloplasmic 
genotype compared with the euplasmic control. Each bar is presented as means ± SEM of five biological repetitions. 



co-evolution of nucleus and cytoplasm has shaped the 
regulatory mechanisms that control the cross-talk be- 
tween the cellular genomes. 

Specific signaling mechanisms are affected by cytoplasm 
substitution 

The present study was carried out in the fully expanded 
leaves of plants grown at 600 uE m~ 2 s" 1 , a condition that 
enables the highlighting of differences due to the activa- 
tion of light-stress-associated processes known to be rele- 
vant for chloroplast and mitochondrion functionality. 
Since the cross-talk between nucleus and cytoplasm 



genomes is controlled by defined anterograde and retro- 
grade signaling mechanisms [66], the alterations detected 
with transcriptomic and metabolomic analysis can be used 
to highlight specific signaling pathways sensitive to small 
variations in the cytoplasm genome. 

The plastid transcripts undergo an extensive and com- 
plex maturation process that includes the splicing of in- 
trons and RNA editing which require specific proteins 
encoded exclusively in the nucleus. Most introns in or- 
ganelles are group II introns, whose catalytic mechanism 
closely resembles that of the nuclear spliceosome. The 
plant nuclear genome contains four organelle maturase- 
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Figure 8 Metabolic pathway map for the alloplasmic genotype TH237 (H. chilense cytoplasm) subjected to high light treatment. 

Each data point is the fold change of the alloplasmic genotype on the euplasmic control. Circles and rectangles denote measured metabolites 
and metabolic genes, respectively. Blue, red and grey points indicate significant (p<0.05) increases, decreases and no change of metabolite and 
enzyme levels, respectively. Metabolic genes measured are mapped and indicated with enzyme EC numbers: arginine decarboxylase (4.1.1.19), 
asparaginase (3.5.1.1), glutamine synthetase (6.3.1.3), aspartyl-phosphate (2.7.2.4), threonine synthase (4.2.3.1), sucrose synthase (2.4.1.13), phospho- 
glucomutase (5.4.2.2), and p-fructofuranosidase (3.2.1.26). 



related genes, out of the context of their evolutionary- mitochondria/chloroplast [28] and act in-trans in the 
related group-II introns dedicated to the nuclear genes splicing of organellar-encoded introns. The finding that 
[67]. All the encoded proteins are targeted to two (AtnMat3 and AtnMat4) out of four nuclear 
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maturase-related genes were up-regulated in all alloplasmic 
lines identifies the anterograde signaling that controls 
the RNA-maturation-related processes as one of the most 
sensitive to a perturbation of the nuclear-cytoplasmic 
interaction. Although no specific targets have been dem- 
onstrated so far for AtnMat3 and AtnMat4, correlative 
evidence supporting a defective maturation process for 
some cytoplasmic-encoded genes can be found in the 
down-regulation of five chloroplast-encoded genes in all 
alloplasmic lines, a trend opposite to the maturase 
up-regulated expression. 

Although no chloroplast retrograde pathway is well- 
understood mechanistically, several signals of various na- 
tures have been reported to trigger retrograde signaling 
from chloroplasts to nucleus, among them the signals gen- 
erated by ROS accumulation. The different forms of ROS 
cause similar cellular damages, but each of them is in- 
volved in the activation of a specific signaling pathway 
[30]. High light conditions, as those employed in this 
work, are known to generate ROS, and consequently, the 
results from our study identifying a number of modifica- 
tions can be linked to ROS-mediated retrograde signaling 
pathways. Light acclimation processes in plants act to dis- 
sipate the excess excitation energy and optimize photosyn- 
thesis under variable light intensity. Under high light 
conditions, the excitation energy might not be sufficiently 
quenched by the PSII reaction center or by carotenoids, 
promoting the formation of the triplet-excited state of the 
chlorophyll and leading to the generation of x 0 2 [68,69]. 
We found that the retrograde signaling pathway triggered 
by O2 was specifically altered in the alloplasmic line car- 
rying the H. chilense cytoplasm. Transcriptomic analysis 
revealed an up-regulation of flu and ex2, two of the key 
genes identified as essential and specific components of 
the x 0 2 signaling. 

Flu is a negative regulator of chlorophyll biosynthesis 
that prevents the accumulation of intermediate that are 
potentially extremely destructive when illuminated, caus- 
ing a massive release of 1 0 2 . The accumulation of 1 0 2 
leads to rapid and selective transcriptional reprogram- 
ming and finally induces programmed cell death in flu 
plants. The primary effect of 1 0 2 generation in the Ara- 
bidopsis flu mutant is the activation of a broad range of 
signaling pathways known to be involved in biotic and 
abiotic stress responses [36,37]. Flu and ex are considered 
specific markers of the 1 0 2 pathway; therefore, their 
up-regulation demonstrates that a change in the chloro- 
plast genome has a direct impact on the chloroplast 
retrograde signaling controlled by x 0 2 . The up-regulation 
of Dl can also be interpreted as a consequence of the 
production of 1 0 2 which, in turn, promotes the irrevers- 
ible oxidation of the Dl protein [68]. In cyanobacterial 
cells, singlet oxygen has been shown to cause direct 
photo-damage to PSII and the Dl protein [70,71] and to 



prevent PSII repair by suppressing the elongation of the 
Dl protein [72]. 

A number of ^Z^-downstream genes were also found 
up-regulated in the H. chilense alloplasmic line. LSD1 is 
a key gene controlling the cross-talk between ROS and 
SA-dependent pathways, leading to light acclimation 
and pathogen defense responses [44]. The up-regulation 
of the LSD1 gene, a negative regulator of cell death, in 
the H. chilense alloplasmic line was associated with the 
altered expression of 26 probe sets involved in the 
hypersensitive response to biotic stress belonging to me- 
thyl jasmonate and salicylic acid pathways (Table 1) and 
with the down-regulation of several genes involved in 
programmed cell death. These results suggest that a 
change in the chloroplast genome has a direct impact on 
the chloroplast retrograde signaling that integrates light 
acclimation, cell death and immune defense responses, 
including genes engaged in ethylene, ROS, salicylic acid, 
glutathione, ABA, sugar and auxin signaling. 

In line with the observation reported above, several sig- 
nificant metabolic alterations found in the H. chilense allo- 
plasmic line can be associated with a typical response to 
high light mediated by the ROS signal [73]. It is known 
that soluble sugars, especially sucrose, glucose and fruc- 
tose, are involved in the responses to a number of stresses, 
and they act as nutrient and metabolite signaling mole- 
cules that activate specific hormone-cross-talk transduc- 
tion pathways, thus resulting in important modifications 
of gene expression and proteomic patterns. Various meta- 
bolic reactions and regulation directly link soluble sugars 
with the production rates of ROS, such as mitochondrial 
respiration and photosynthesis regulation and, conversely, 
with anti-oxidative processes. A metabolic analysis of the 
H. chilense alloplasmic line showed significant increases in 
sugars, sugar phosphates and mannitol, with a remarkable 
enhancement in raffinose abundance (33-fold) and of its 
downstream metabolite galactose. The accumulation of 
raffinose is in agreement with the up-regulation of two 
key enzymes (sucrose synthase and |3-fructofuranosidase) 
of the raffinose metabolism (Figure 8). These oligosaccha- 
rides are associated with a number of stresses in plants 
[74] e.g., drought, salt, temperature [75] and light [76]. 
They function mainly as osmoprotectants, and raffinose is 
known to play a key role in ROS scavenging and the pro- 
tection of the metabolism in chloroplasts from the oxida- 
tive damage caused by HL [77] . The increased abundance 
of a number of polyols could further support a limited re- 
sponse to light of the alloplasmic line [78,79]. For ex- 
ample, polyols are reported to play an important role in 
the protection of sensitive enzymes and membranes from 
ROS in the cytoplasm [80]. Taken together, these results 
suggest that, the H. chilense alloplasmic line is likely im- 
paired in electron dissipation, and thus, exposure to light 
leads to enhanced ROS generation and the induction of 
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ROS-scavenging processes and the accumulation of mem- 
brane protective compounds. 

Cytoplasm male sterility-associated genes are modulated 
in response to cytoplasm substitution 

Although the alloplasmic lines here tested were all fully 
male fertile, the finding that some genes known to be in- 
volved in the determination of CMS have an altered ex- 
pression due to the substitution of a wheat cytoplasm 
with the cytoplasm of H. chilense further demonstrates 
the involvement of the nuclear-cytoplasmic cross-talk in 
CMS, as already reported for CMS that originated after 
inter-species crosses [10,81,82]. It can be suggested that 
in the specific H. chilense alloplasmic line here employed 
(TH237), the mechanisms leading to CMS have been 
only partially modified without a phenotypic effect, while 
a more strong de-regulation of the genes here described 
(or of some of them) might lead to male sterility. Indeed, 
the male fertility of the H. chilense alloplasmic lines var- 
ies depending on the H. chilense accession used. Some 
H. chilense accessions give full fertility, while others lead 
to male sterility as was the case of the HI accession [10]. 
The fertility restoration of CMS in the msHl system is 
caused by genes located on chromosome 6H ch [10,11]. 
Notably, ten probe sets, corresponding to genes modu- 
lated in the H. chilense alloplasmic line and related to 
CMS (LOX2, acyl CoA reductase, CER4 and POP2), 
matched to Brachypodium genes from chromosome 3 in 
the region syntenic with barley chromosome 6H [83]. 
One of them (LOX2) also found a match in the barley 
genome zipper on chromosome 6H (http://mips.helm- 
holtz-muenchen.de/ plant/barley/ gz/ search) sp/ index.) sp) . 
These genes may be good candidates by which to inves- 
tigate the restoration of CMS in the msHl system. 

Conclusions 

The replacement of the wheat cytoplasm with the cyto- 
plasm of a related species affects the nuclear-cytoplasmic 
cross-talk leading to transcript and metabolite alterations. 
The alteration of the nuclear-cytoplasmic cross-talks 
might reflect an impaired light response capacity that ex- 
poses the alloplasmic lines to some degree of stress, as 
suggested by the altered expression of ROS -responsive 
genes and the accumulation of stress-related metabolites. 
The extent of these modifications was limited in the allo- 
plasmic lines with the Aegilops cytoplasm, and more rele- 
vant in the alloplasmic line with the H. chilense cytoplasm. 
We consider that, this finding might be linked to the 
phylogenetic distance of the genomes. 

Methods 

Plant material and growth conditions 

Three alloplasmic lines (T183, T195 and TH237) were 
used in this work. The alloplasmic lines T183 and T195 



were developed through the introgression of the cyto- 
plasm from Aegilops uniaristata (T183) and Aegilops 
tauschii (T195) in the nuclear background of Triticum 
aestivum cv. Chris by Prof. S.S. Maan (North Dakota 
State University, USA) after 10 backcrosses as described 
by Busch and Maan [23]. The alloplasmic line TH237 
was produced by introgressing the Hordeum chilense ac- 
cession H7 cytoplasm into the nuclear background of 
T. aestivum accession T20 [13]. The seeds of the allo- 
plasmic lines were multiplied at IAS-CSIC and verified 
for cytoplasm identity with specific cytoplasm markers. 
T183 and T195 were checked using the cleaved ampli- 
fied polymorphic sequence (CAPS) psbE & psbF-Hpall 
[64] that distinguishes between the T. aestivum and 
Aegilops cytoplasm, while TH237 was confirmed using 
a ccSSR marker [84] as described by Atienza et al. 
[13]. The alloplasmic lines have overall normal pheno- 
types that differ from their euplasmic control only for 
a few phenotypic traits (e.g., plant height) and yielding 
capacity [13]. 

Alloplasmic line TH237 was produced by transferring 
the Hordeum chilense accession H7 cytoplasm into the 
nuclear background of the Triticum aestivum accession 
T20 [13]. Since H. chilense and wheat chromosomes do 
not pair during meiosis, the alloplasmic line was ob- 
tained after two backcrosses as previously reported [13]. 
The euplasmic backgrounds were used as the control. 
Fifty seeds were sown in five biological replicates for 
each sample in pots of 11 cm diameter and grown in 
controlled conditions under two light intensities, 150 
and 600 uE m~ s" , denoted low and high light intensity, 
respectively, in a daily regime of 12 h light at 22°C and 
12 h darkness at 15°C. Plants were bulked from each 
pot, and three biological replicates were used for the 
transcriptomics, while five biological replicates were 
used for the metabolic profiling. Fully expanded second 
leaves were collected two weeks from germination in the 
middle of the light period and frozen in liquid nitrogen 
until further analysis. 

RNA isolation and array hybridization 

RNA samples from the leaves of alloplasmic and the cor- 
responding euplasmic plants grown under 600 uE m~ s' 
were prepared using a TRIZOL reagent according to the 
method published at the Arabidopsis Functional Genom- 
ics Consortium (http://www.arabidopsis.org/portals/masc/ 
AFGC/RevisedAFGC/site2RnaL.htm#isolation) and fur- 
ther cleaned using RNeasy columns according to the Qiagen 
RNeasy Mini Handbook. RNA quantity and integrity 
were confirmed on an Agilent Bioanalyzer 2100 using the 
RNA 6000 nano Kit RNA samples were processed follow- 
ing the AffymetrixGeneChip Expression Analysis Technical 
Manual (Affymetrix, Inc., Santa Clara, CA). Single-stranded, 
then double-stranded, cDNAs were synthesized from the 
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poly (A) mRNA isolated from 5 ug of total RNA for each 
sample using the Affymetrix One-Cycle Labelling Kit and 
Control reagents. The resulting ds-cUNA was column puri- 
fied and then used as a template to generate biotin-tagged 
cRNA from an in vitro transcription reaction (IVT), using 
the Affymetrix Gene-Chip IVT Labelling Kit. The resulting 
biotin-tagged cRNA was fragmented and then hybridized at 
45°C for 16 h (Affymetrix Gene-Chip Hybridization Oven 
640) to the probe sets present on an Affymetrix Gene-Chip 8 
Wheat Genome Array. The arrays were washed and 
then stained (SAPE, Streptavidin-phycoerythrin) on an 
Affymetrix Fluidics Station 450, followed by scanning 
with a Gene-Chip Scanner 3000. 

Hybridization quality was verified using the standard 
Affymetrix controls, and all hybridizations showed the 
expected checkerboard pictures. The collected data were 
normalized using an RMA algorithm. The average back- 
ground was 40.07, well within recommended levels. The 
percentage of "present" calls ranged between 42.9% and 
46.9% of the 60 K probe sets of the array. To value the 
quality of biological replicates, R-squared was calculated 
between the replicates of the same sample, and the 
values ranged between 0.97 and 0.99 with an average 
value of 0.976. 

Wheat microarray design and expression profiling data 
are available in Plexdb (http://www.plexdb.org) as ex- 
periment TA49: "alteration in transcript level in wheat 
alloplasmic lines". 

Transcript data processing and analysis 

Raw intensity values were normalized by RMA [85], 
using the R package Affymetrix library [86] . The same li- 
brary was used to run the MAS 5.0 algorithm on raw 
data to produce a detection call for each probe set. 
These detection calls (P = "present", M = "marginal" or 
A = "absent") were used to apply an initial filtering step, 
since genes not expressed ("absent") represent experi- 
mental noise and can generate false positives. All probe 
sets that didn't show "present" calls in all three reps of 
at least one sample were removed. R-squared linear cor- 
relation coefficients were computed on the RMA expres- 
sion values (log 2 -transformed) for each set of biological 
triplicates. RMA-filtered data were imported to the 
Genespring GX 7.3 (Agilent Technologies) software for 
all subsequent analyses. Three comparisons were done 
to evaluate the impact of different cytoplasms on nuclear 
gene expression at 600 uE m' 2 s" : Ae. tauschii (alloplas- 
mic line) vs. Chris (euplasmic control), Ae. uniaristata 
(alloplasmic line) vs. Chris (euplasmic control), and 
H. chilense (alloplasmic line) vs. T20 (euplasmic control). 
Differentially expressed probe sets were identified 
through a Welch t-test with a Benjamini and Hochberg 
false discovery rate correction for multiple tests [87]. 
Differences in gene expression were considered to be 



significant when the p < 0.05 and the induction or re- 
pression ratio was equal to or higher than two-fold. A 
Principal Component Analysis (PCA), a mathematical 
procedure that uses orthogonal transformation to con- 
vert a set of observations of possibly correlated variables 
into a set of values of linearly uncorrelated variables 
called principal components, is often used to analyse 
array and metabolomic data [25]. A PCA was carried 
out with Genespring GX 7.3 software and employed to 
assess the contribution of the genetic backgrounds and 
of the cytoplasms on the observed variations in gene ex- 
pression. The array data were analysed using a two-fold 
change as the cut-off, followed by an analysis for statisti- 
cally significant changes using a Welch t-test and a false 
discovery rate correction for multiple testing [87]. 

Blast searches were done using a HarvEST Affymetrix 
Wheatl Chip 1.50 (http://www.harvest.ucr.edu), and 
only the annotations of wheat probe sets with a hom- 
ology level cut-off equal to or lower than E-value = e" 1 
were considered. The functional classification was based 
on the wheat gene identifier categories (Affymetrix 
Gene-Chip Wheat probe Taes_0709) reported by the 
MapMan 3.1.0 software (http://gabi.rzpd.de/projects/ 
MapMan/), a user-driven tool for displaying genomics 
datasets on diagrams of metabolic pathways and other 
biological processes [26]. The log2 expression ratio of 
genes modulated in the comparison between alloplasmic 
line TH237 (H. chilense cytoplasm) and T20 euplasmic 
control were loaded into the MapMan Image Annotator 
Module to map the wheat transcriptome data into define 
functional categories and to display them onto pathway 
diagrams. In the color scale, red represents a lower gene 
expression and blue represents a higher gene expression 
in the H. chilense alloplasmic line, as compared with the 
corresponding T20 euplasmic. The complete set of genes 
submitted to the MapMan analysis is given in Additional 
file 1: Table SI. 

Probe set descriptions, functional classification and mito- 
chondrial/chloroplast localization were then verified and 
implemented manually and by other bioinformatics re- 
sources, such as MIPS Arabidopsis thaliana database 
(MAtDB) Functional Catalogue (FunCat) (http://mips. 
helmholtz-muenchen.de/proj/funcatDB) and Plexdb (plant 
expression database http://www.plexdb.org). 

Validation of array data with real-time reverse transcrip- 
tion PCR (qRT-PCR) 

An array validation was carried out with six genes {AtnMatS, 
Executerl, Flu, psbM, FAR, LSD1, Additional file 2: 
Table S2), selected because they are representative of 
key pathways altered in the alloplasmic lines. Total 
RNA (400 ng) was reverse transcribed using gene spe- 
cific primers with Superscript II (Invitrogen) according 
to the manufacturer's protocol. Subsequently, the cDNAs 
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were used for q-PCR amplifications with SYBR Green 
fluorescence detection. The reactions (final volume of 
25 ul) were set with a QuantiFast SYBR Green PCR Master 
Mix (Qiagen) together with forward and reverse primers 
(10 mM; 2.5 ul each, see Additional file 2: Table S2) and 
amplified using standard cycling conditions: 95°C (10 min) 
and 40 cycles of amplification (95°C for 30 s and 60°C 
for 1 min). Expression was determined for three analytical 
replicates. The wheat polyubiquitin gene (Ta.24299.1.Sl_at) 
was used as a reference. 

Metabolic profiling using GC-MS-based method 

Metabolite analyses from the leaves of alloplasmic and 
the corresponding euplasmic plants grown at 150 
and 600 uE m' 2 s" 1 were carried out using a GC-MS- 
based protocol described by Lisec et al. [88] and 
Roesnner et al. [89] with a few modifications. Leaf 
tissues (100 mg) were homogenized using a ball mill 
pre-cooled with liquid nitrogen and extracted in 1400 ul 
of methanol, 60 ul of 0.2 mg ribitol ml - in methanol as 
an internal quantitative standard for the polar phase. 
The mixture was extracted for 15 min at 70°C, mixed 
vigorously, and centrifuged for 10 min at 14000 rpm. 
The supernatant was transferred to clean 2 ml tubes. In 
order to separate polar and non-polar metabolites, 
750 ul chloroform and 1500 ul of water was added to 
the supernatant. After centrifugation at 4000 rpm for 
15 min, 200 ul of the upper methanol/water phase was 
transferred to new tubes and reduced to dryness in 
vacuo. Dried samples were re-dissolved in 40 ul of 
20 mg ml - methoxyaminehydrochloride in pyridine and 
derivatized for 120 min at 37°C followed by a 30 min 
treatment with 70 ul MSTFA (Af-methyl-AT- [trimethylsi- 
lyl] trifluoroacetamide) at 37°C and 8 ul of a retention 
time standard mixture (3.7% [w/v] hepatonic acid, 3.7% 
[w/v] nonanoic acid, 3.7% [w/v] undecanoic acid, 3.7% 
[w/v] tridecanoic acid, 3.7% [w/v] pentadecanic acid, 
7.4% [w/v] nonadeanoic acid, 7.4% [w/v] tricosanoic 
acid, 22.2% [w/v] heptacosanoic acid, and 55.5% [w/v] 
hentriacontanoic acid dissolved in 10 mg ml _1 tetra- 
hydrofuran) was added before trimethylsilylation. Sample 
volumes of 1 ul were then injected with a split ratio of 
25:1, using a hot needle technique. The gas chromatog- 
raphy-mass spectrometry (GC-MS) system was com- 
posed of an AS 2000 auto-sampler, a GC 8000 gas 
chromatograph, and a Voyager quadrole mass spectrom- 
eter (Thermo-Quest, Manchester, UK). The mass spec- 
trometer was tuned according to the manufacturer's 
recommendations, using tris-(perfluorobutyl)-amine 
(CF43). GC was performed on a 30-m SPB-50 column 
with 0.25-um film thickness (Superlco, Bellfonte, CA). 
The injection temperature was set at 230°C, the inter- 
face at 250°C, and the ion source adjusted to 200°C. 
Helium was used as the carrier gas at a flow rate of 



1 ml min -1 . The analysis was performed under the 
following temperature program: 5 min of isothermal 
heating at 70°C, followed by a 5°C min -1 oven 
temperature ramp to 310°C, and a final 1 min of 
heating at 310°C. The system was then temperature 
equilibrated for 6 min at 70°C before the injection of the 
next sample. Mass spectra were recorded at 2 scan 
sec -1 with a scanning range of 50 to 600 m/z. Data was 
analysed using Tagfinder software [90] and statistical 
analysis including an ANOVA, a student t-test and a 
principle component analysis that were performed 
using dedicated software, Microsoft Excel, R software 
environment (http://www.R-project.org; R Development 
Core Team, 2004) and TMEv (http://www.tm4.org/ 
mev/) [91]. 

Data analysis using principal component analysis 

Principal component analysis (PCA) visualizes the disper- 
sion of metabolites/genes and the formation of groups of 
co-dispersed metabolites/genes. A PCA was performed 
with the software package TMEv [91] for metabolite data 
using the default weighted covariance estimation function. 
PCA is an unsupervised multivariate method that allows 
patterns, trends, groups and outliers in a large datasets to 
be easily identified. The dimensionality of complex data is 
reduced to what are called Principal Components (PCs) 
that retain the maximal amount of variation within a sam- 
ples' population. For PC and average based statistical ana- 
lysis, the data was login transformed and normalized to 
the median of the entire sample set for each metabolite 
before analysis. This transformation reduces the influence 
of outiiers and increases symmetry in a dataset [25]. 

Availability of supporting data 

The datasets supporting the results of this article are in- 
cluded within the article and its additional files. 



Additional files 



Additional file 1: Table SI. Probe sets detecting transcripts more than 
two-fold up-or down-regulated in the comparisons between each 
alloplasmic line and the corresponding euplasmic control. 

Additional file 2: Table S2. Genes selected for array validation and the 
corresponding primers employed in the qRT-PCR analysis. Expression data 
detected with microarray and qRT-PCR analysis are reported for comparison. 

Additional file 3: Table S3. Data of two-way ANOVA metabolites that 
significantly changed in the alloplasmic line TH237 (H. chilense cytoplasm) 
and its euplasmic control T20 due to genotype, light treatment and the 
interaction effect between them. 

Additional file 4: Table S4. Data of two-way ANOVA metabolites that 
significantly changed in the alloplasmic lines T183 (Ae. uniaristata 
cytoplasm) and T195 (Ae. tauschii cytoplasm) and their euplasmic control 
(Chris) due to genotype, light treatment and the interaction effect 
between them. 
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Additional file 5: Table S5. Metabolite variations in the comparison 
between the alloplasmic line with the H. chilense cytoplasm (TH237) and 
the corresponding wheat euplasmic control (T20). Values expressed as 
fold change for plants grown at low light and high light. 

Additional file 6: Table S6. Metabolite variations in the comparison 
between the alloplasmic lines with Ae. uniaristata (T183) or Ae. tauschii 
(T195) and the corresponding wheat euplasmic control (Chris). Values 
expressed as fold change for plants grown at low light and high light. 

Additional file 7: Table S7. Fatty acids measured in the alloplasmic line 
TH237 (H. chilense cytoplasm) and its euplasmic control T20. Data was 
subjected to a student t-test to test for fatty acids that were significant 
(p<0.05) between the alloplasmic line and its euplasmic control. 
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